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Title: SARS 

5 

The invention relates to the field of virology. 

Recently, a new virus has caused a global health risk because of its pathogenic 
effects in man combined with a relatively easily droplet transmission. The virus first 

10 was seen in the Chinese province Guangdong in February 2003, and within two 

months it has been able to spread to several countries all over the world where it has 
caused 78 deaths out of 2300 people infected (New Scientist Online News 13:25 02 
April 2003). The virus has been named SARS (Severe Acute Respiratory Syndrome) 
virus and causes a respiratory illness (atypical pneumonia) in man. This illness 

15 usually begins with a fever, sometimes associated with chills or other symptoms, 

including headache, a general feeling of discomfort and body aches. Some people also 
experience mild respiratory syndromes at the outset. 

After 2 to 7 days, SARS patients may develop a dry, nonproductive cough that 
might be accompanied or progress to the point where insuffiecient oxygen is getting 

20 to the blood. In 10% to 20% of the cases, patients will require mechanical ventilation, 
and eventually the disease can lead to the death of the patient. Hospital personnel, 
children, elderly and people having an underlying condition such as diabetes or heart 
disease, or a weakened immune system, form the highest risk group. Co-infection 
with other pathogens seems to occur frequently, especially with opportunistic 

25 pathogenic microorganisms such as human metapneumovirus (hMPV), Chlamydia, 
etcetera. 

The incubation time for the virus is typically 2-7 days and the disease is 
transmitted by people sich with SARS coughing or sneezing droplets in the air. 

As for yet it is not known if there is a cure for the disease. Several antiviral 
3 0 therapies have been applied, but with various results. 

Also, for being able to prevent spread of the disease, it is of great importance 
to be able to recognise the disease in an early stage. Only then sufficient measures 
can be taken to isolate patients and initiate quarantaine precautions. At this moment 
there is not yet a diagnostic tool in place. 
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Thus, there is great need in developing diagnostic tools and therapies for this 
disease. 

The invention provides the nucleotide sequence of an isolated essentially 
5 mammalian positive-sense single stranded RNA virus belonging to the 

Coronaviruses, which is the causative factor for SABS. From a phylogenetic analysis 
of the sequences of the virus (Fig. 1) it appears that the virus is an intermediate 
between the group formed by TGEV (transmissable gastroenetritis virus), PEDV 
(porcine epidemic diarrhea virus) and 229E (human coronavirus 229E) at one side, 
10 the group formed by BoCo (bovine coronavirus) and MHV (murine hepatitis virus) at 
an other side, and the AIBV (avian infectious bronchitis virus) on yet another side . 
In general, bovine coronavirus seems to be the closest relative (at least for the viral 
replicase protein). 

Although phylogenetic analyses provide a convenient method of identifying a 
15 virus as a SARS virus several other possibly more straightforward albeit somewhat 
more course methods for identifying said virus or viral proteins or nucleic adds from 
said virus are herein also provided. As a rule of thumb a SARS virus can be identified 
by the percentages of a homology of the virus, proteins or nucleic acids to be 
identified in comparison with viral proteins or nucleic acids identified herein by 
2 0 sequence. It is generally known that virus species, especially RNA virus species, often 
constitute a quasi species wherein a cluster of said viruses displays heterogeneity 
among its members. Thus it is expected that each isolate may have a somewhat 
different percentage relationship with the sequences of the isolate as provided herein. 
When one wishes to compare a virus isolate with the sequences as listed in 

2 5 figure 2, the invention provides an isolated essentially mammalian positive-sense 

single stranded RNA virus (SARS) belonging to the Coronaviruses and identifiable as 
phylogenetically corresponding thereto by determining a nucleic acid sequence of said 
virus and determining that said nucleic acid sequence has a percentage nucleic acid 
identity to the sequences as listed higher than the percentages identified herein for 

3 0 the nucleic acids as identified herein below in comparison with BoCo, AIPV and 

PEDV. Likewise, an isolated essentially mammalian positive-sense single stranded 
RNA virus (SARS) belonging to the Coronaviruses and identifiable as 
phylogenetically corresponding thereto by determining an amino acid sequence of 
said virus and determining that said amino acid sequence has a percentage amino 
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acid homology to the sequences as listed which is essentially higher than the 
percentages provided herein in comparison with BoCo, AIPV and PEDV. 

With the provision of the sequence information of this SARS virus, the 
5 invention provides diagnostic means and methods and therapeutic means and 

methods to he employed in the diagnosis and/or treatment of disease, in particular of 
respiratory disease (atypical pneumonia), in particular of mammals, more in 
particular in humans. In virology, it is most advisory that diagnosis and/or treatment 
of a specific viral infection is performed with reagents that are most specific for said 

10 specific virus-causing said infection. In this case this means that it is preferred that 
said diagnosis and/or treatment of a SARS virus infection is performed with reagents 
that are most specific for SARS virus. This by no means however excludes the 
possibility that less specific, but sufficiently cross-reactive reagents are used instead, 
for example because they are more easily available and sufficiently address the task 

15 at hand. 

The invention for example provides a method for virologically diagnosing a SARS 
infection of an animal, in particular of a mammal, more in particular of a human 
being, comprising determining in a sample of said animal the presence of a viral 
isolate or component thereof by reacting said sample with a SARS specific nucleic 
2 0 acid a or antibody according to the invention, and a method for serologically 

diagnosing a SARS infection of a mammal comprising deter m i nin g in a sample of said 
mammal the presence of an antibody specifically directed against a SARS virus or 
component thereof by reacting said sample with a SARS virus-specific proteinaceous 
molecule or fragment thereof or an antigen according to the invention. 

2 5 The invention also provides a diagnostic kit for diagnosing a SARS infection 

comprising a SARS virus, a SARS virus-specific nucleic acid, proteinaceous molecule 
or fragment thereof antigen and/or an antibody according to the invention, and 
preferably a means for detecting said SARS virus, SARS virus-specific nucleic acid, 
proteinaceous molecule or fragment thereof, antigen and/or an antibody, said means 

3 0 for example comprising an excitable group such as a fluorophore or enzymatic 

detection system used in the art (examples of suitable diagnostic kit format comprise 
IF, ELISA, neutralization assay, RT-PCR assay). To determine whether an as yet 
unidentified virus component or synthetic analogue thereof 6uch as nucleic acid, 
proteinaceous molecule or fragment thereof can be identified as SARS-virus-specffic, 
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it suffices to analyse the nucleic acid or amino acid sequence of said component, for 
example for a stretch of said nucleic acid or amino acid, preferably of at least 10, 
more preferably at least 25, more preferably at least 40 nucleotides or amino acids 
(respectively), by sequence homology comparison with the provided SABS viral 
5 sequences and with known non-SARS viral sequences (BoCo is preferably used) using 
for example phylogenetic analyses as provided herein. Depending on the degree of 
relationship with said SARS or non-SARS viral sequences, the component or 
synthetic analogue can be identified. 

10 

The invention thus provides the nucleotide sequence of a novel etiological 
agent, an isolated essentially mammalian positive-sense single stranded RNA virus 
(herein also called SARS virus) belonging to the Coronaviruses, and SARS virus- 
specific components or synthetic analogues thereof Coronaviruses were first isolated 

15 from chickens in 1937, while the first human coronavirus was propagated in vitro by 
Tyrell and Bonoe in 1965. There are now about 13 species in this family, which infect 
cattle, pigs, rodents, cats, dogs, birds and man. Coronaviruses particles are 
irregularly shaped, about 60-220 nm in diameter, with an outer envelope bearing 
distinctive, 'club-shaped 1 peplomers ( about 20 nm long and 10 run wide at the distal 

20 end). This 'crown-like' appearance give the family its name. The envelope carries two 
glycoproteins: S, the spike glycoprotein which is involved in cell fusion and is a major 
antigen, and M, the membrane glycoprotein, which is involved in budding and 
envelope formation. The genome is associated with a basic phosphoprotein, 
designated N. The genome of coronaviruses, a single stranded positive-sense RNA 

2 5 strand, is typically 27-31 Kb long and contains a 5 1 methylated cap and a 3 1 poly-A 

tail, by which it can directly function as an mRNA in the infected cell. Initially the 5' 
ORF 1 (about 20 Kb) is translated to produce a viral polymerase, which then produces 
a full length negative sense strand. This is used as a template to produce mRNA as a 
'nested set 1 of transcripts, all with identical 5' non-translated leader sequence of 72 

3 0 nucleotides and coincident 3 ! polyadenylated ends. Each mRNA thus produced is 

monocistronic, the genes at the 5 1 end being translated from the longest mRNA and 
so on. These unusual cytoplasmic structures are produced not by splicing, but by the 
polymerase during transcription. Between each of the genes there is a repeated 
intergenic sequence - AACUAAAC — which interacts with the transcriptase plus 
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cellular factors to splice the leader sequence onto the start of each ORF. In some 
coronaviruses there are about 8 ORFs, coding for the proteins mentioned above, but 
also for a heamagglutenin esterase (HE), and several other non-structural proteins. 
Newly isolated viruses are phylogenetically corresponding to and thus taxonomically 
5 corresponding to SARS virus when comprising a gene order and/or amino acid 
sequence and/or nucleotide sequence sufficiently similar to our prototypic SARS 
virus.. The highest amino acid sequence homology, and defining the structural 
correspondence on the individual protein level, between SARS virus and any of the 
known other viruses of the same family to date (BoCo?) is for parts of the polymerase 

1 0 protein xx-xx%, as can be deduced when comparing the sequences given in figure 2 
with sequences of other viruses, in particular of BoCo. Individual proteins or whole 
virus isolates with, respectively, higher homology to these mentioned maximum 
values are considered phylogenetically corresponding and thus taxonomically 
corresponding to SARS virus, and comprise a nucleic acid sequence structurally 

15 corresponding with a sequence as shown in figure 2. Herewith the invention provides 
a virus phylogenetically corresponding to the isolated virus of which the sequences 
are depicted in figure 2. 

It should be noted that, similar to other viruses, a certain degree of variation can be 
expected to be found between different isolated SARS-viruses. 

2 0 Also, the viral sequence of the SARS virus or an an isolated SARS virus gene as 

provided herein for example shows less than 95%, preferably less than 90%, more 
preferably less than 80%, more preferably less than 70% and most preferably less 
than 65% nucleotide sequence homology or less than 95%, preferably less than 90%, 
more preferably less than 80%, more preferably less than 70% and most preferably 
25 less than 65% amino acid sequence homology with the respective nucleotide or amino 
acid sequence of the bovine coronavirus or the murine hepatitis virus as for example 
can be found in Genbank. 

Sequence divergence of SARS strains around the world may be somewhat higher, in 
analogy with other coronaviruses. 

3 0 The term "nucleotide sequence homology" as used herein denotes the presence of 

homology between two (polynucleotides. Polynucleotides have "homologous" 
sequences if the sequence of nucleotides in the two sequences is the same when 
aligned for maximum correspondence. Sequence comparison between two or more 
polynucleotides is generally performed by comparing portions of the two sequences 
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over a comparison window to identify and compare local regions of sequence 
similarity. The comparison window is generally from about 20 to 200 contiguous 
nucleotides. The "percentage of sequence homology" for polynucleotides, such as 50, 
60, 70, 80, 90, 95, 98, 99 or 100 percent sequence homology may be determined by 
5 comparing two optimally aligned sequences over a comparison window, wherein the 
portion of the polynucleotide sequence in the comparison window may include 
additions or deletions (i.e. gaps) as compared to the reference sequence (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by: (a) determining the number of positions at which the 

10 identical nucleic acid base occurs in both sequences to yield the number of matched 
positions; (b) dividing the number of matched positions by the total number of 
positions in the window of comparison; and (c) multiplying the result by 100 to yield 
the percentage of sequence homology. Optimal alignment of sequences for comparison 
may be conducted by computerized implementations of known algorithms, or by 

15 inspection. Readily available sequence comparison and multiple sequence alignment 
algorithms are, respectively, the Basic Local Alignment Search Tool (BLAST) 
(Altschul, S.F. et al. 1990. J. Mol. Biol. 215:403; Altschul, S.F. et al. 1997. Nucleic 
Acid Res. 25:3389-3402) and ClustalW programs both available on the internet. Other 
suitable programs include GAP, BESTFIT and FASTA in the Wisconsin Genetics 

20 Software Package (Genetics Computer Group (GCG), Madison, WI, USA). 
As used herein, "substantially complementary" means that two nucleic acid 
sequences have at least about 65%, preferably about 70%, more preferably about 80%, 
even more preferably 90%, and most preferably about 98%, sequence 
complementarity to each other. This means that the primers and probes must exhibit 

25 sufficient complementarity to their template and target nucleic acid, respectively, to 
hybridise under stringent conditions. Therefore, the primer sequences as disclosed in 
this specification need not reflect the exact sequence of the binding region on the 
template and degenerate primers can be used. A substantially complementary primer 
sequence is one that has sufficient sequence complementarity to the amplification 

3 0 template to result in primer binding and second-strand synthesis. 

The term "hybrid" refers to a double-stranded nucleic acid molecule, or duplex, 
formed by hydrogen bonding between complementary nucleotides. The terms 
"hybridise" or "anneal" refer to the process by which single strands of nucleic acid 
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sequences form double-helical segments through hydrogen bonding between 
complementary nucleotides. . 

The term "oligonucleotide" refers to a short sequence of nucleotide monomers (usually 
6 to 100 nucleotides) joined by phosphorous linkages (e.g., phosphodiester, alkyl and 
5 aryl-phosphate, phosphorothioate), or non-phosphorous linkages (e.g., peptide, 

sulfamate and others). An oligonucleotide may contain modified nucleotides having 
modified bases (e.g., 5-methyl cytosine) and modified sugar groups (e.g., 2 , -0-methyl 
ribosyl, 2'-0-methoxyethyl ribosyl, 2'-fluoro ribosyl, 2'-amino ribosyl, and the like). 
Oligonucleotides may be naturaUy-occurring or synthetic molecules of double- and 

1 0 single-stranded DNA and double- and single-stranded UNA with circular, branched 
or linear shapes and optionally including domains capable of forming stable 
secondary structures (e.g., stem-and-loop and loop-stem-loop structures). 
The term "primer" as used herein refers to an oligonucleotide which is capable of 
annealing-to the amplification target allowing a DNA polymerase to attach thereby 

15 serving as a point of initiation of DNA synthesis when placed under conditions in 

which synthesis of primer extension product which is complementary to a nucleic acid 
strand is induced, i.e., in the presence of nucleotides and an agent for polymerization 
such as DNA polymerase and at a suitable temperature and pH. The (amplification) 
primer is preferably single stranded for maximum efficiency in amplification. 

2 0 Preferably, the primer is an oligodeoxy ribonucleotide. The primer must be 

sufficiently long to prime the synthesis of extension products in the presence of the 
agent for polymerization. The exact lengths of the primers will depend on many 
factors, including temperature and source of primer. A "pair of bi-directional primers" 
as used herein refers to one forward and one reverse primer as commonly used in the 

2 5 art of DNA amplification such as in PCR amplification. 

The term "probe" refers to a single-Btranded oligonucleotide sequence that will 
recognize and form a hydrogen-bonded duplex with a complementary sequence in a 
target nucleic acid sequence analyte or its cDNA derivative. 

The terms "stringency" or "stringent hybridization conditions" refer to hybridization 

3 0 conditions that affect the stability of hybrids, e.g., temperature, salt concentration, 

pH, formamide concentration and the like. These conditions are empirically optimised 
to maximize specific binding and minimize non-specific binding of primer or probe to 
its target nucleic acid sequence. The terms as used include reference to conditions 
under which a probe or primer will hybridise to its target sequence, to a detectably 
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greater degree than other sequences (e.g. at least 2-fold over background). Stringent 
conditions are sequence dependent and will be different in different circumstances. 
Longer sequences hybridise specifically at higher temperatures. Generally, stringent 
conditions are selected to be about 5°C lower than the thermal melting point (Tm) for 
the specific sequence at a defined ionic strength and pH. The Tm is the temperature 
(under defined ionic strength and pH) at which 50% of a complementary target 
sequence hybridises to a perfectly matched probe or primer. Typically, stringent 
conditions will be those in which the salt concentration is less than about 1.0 M Na+ 
ion, typically about 0.01 to 1.0 M Na+ ion concentration (or other salts) at pH 7.0 to 
8.3 and the temperature is at least about 30°C for short probes or primers (e.g. 10 to 
50 nucleotides) and at least about 60°C for long probes or primers (e.g. greater than 
50 nucleotides). Stringent conditions may also be achieved with the addition of 
destabilizing agents such as formamide. Exemplary low stringent conditions or 
"conditions of reduced stringency" include hybridization with a buffer solution of 30% 
formamide, 1 M NaCl, 1% SDS at 37°C and a wash in 2x SSC at 40°C. Exemplary 
high stringency conditions include hybridization in 50% formamide, 1 M NaCl, 1% 
SDS at 37°C, and a wash in O.lx SSC at 60°C. Hybridization procedures are well 
known in the art and are described in e.g. Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons Inc., 1994. 

The term "antibody" includes reference to antigen binding forms of antibodies (e. g., 
Fab, F (ab) 2). The term "antibody 1 * frequently refers to a polypeptide substantially 
encoded by an immunoglobulin gene or immunoglobulin genes, or fragments thereof 
which specifically bind and recognize an analyte (antigen). However, while various 
antibody fragments can be defined in terms of the digestion of an intact antibody, one 
of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by utilizing recombinant DNA methodology. Thus, the term antibody, 
as used herein, also includes antibody fragments such as single chain Fv, chimeric 
antibodies (i. e., comprising constant and variable regions from different species), 
humanized antibodies (i. e., comprising a complementarity determining region (CDR) 
from a non-human source) and heteroconjugate antibodies (e. g., bispecific 
antibodies). 

In short, the invention provides an isolated essentially mammalian positive- 
sense single stranded UNA virus (SABS) belonging to the Coronaviruses and 
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identifiable as phylogenetically corresponding thereto by determining a nucleic acid 
sequence of a suitable fragment of the genome of said virus and testing it in 
phylogenetic tree analyses wherein maximum likelihood trees are generated using 
100 bootstraps and 3 jumbles and finding it to be more closely phylogenetically 
5 corresponding to a virus isolate having the sequences as depicted in figure 2 than it is 
corresponding to a virus isolate of BoCo (bovine coronavirus), MHV (murine hepatitis 
virus), AIBV (avian infectious bronchitis virus), PEDV (porcine epidemic diarrhea 
virus), TGEV (transmissible gastroenteritis virus) or 229E (human coronavirus 
229E). 

10 Suitable nucleic acid genome fragments each useful for such phylogenetic tree 

analyses are for example any of the RAP-PCR fragments 1 to 12 as disclosed in 
figure 2, leading to the phylogenetic tree analysis as disclosed herein in figure 1. 
A suitable open reading frame (ORF) comprises the ORF encoding the viral 
polymerase (ORF la). When an overall amino acid identity of at least 91%, preferably 

15 of at least 95% of the analysed polymerase with the polymerase having a sequence 
comprising the fragments EMC-1, EMC-2, EMC-3, EMC-4 and EMC- 5 of figure 2 is 
found, the analysed virus isolate comprises a SARS virus isolate according to the 
invention. 

Another suitable open reading frame (ORF) useful in phylogenetic analyses 
20 comprises the ORF encoding the N protein. When an overall amino acid identity of at 
least 70%, preferably of at least 85% of the analysed N-protein with the N-protein 
encoded by a sequence comprising the sequence EMC-8 of figure 2 is found, the 
analysed virus isolate comprises a SARS isolate according to the invention. 

2 5 Another suitable open reading frame (ORF) useful in phylogenetic analyses 

comprises the ORF encoding the spike protein S. When an overall amino acid identity 
of at least 95%, preferably of at least 97% of the analysed S-protein with the sequence 
MFIFLLFLTLTSGSDLDRCT of the S-protein of isolate HK39849 is 

found, the analysed virus isolate comprises a SARS virus isolate according to the 

3 0 invention. The S ORF of the SARS virus seems to be located adjacent to the ORF lab 

(coding for the viral polymerase), which would discriminate SARS viruses from the 
bovine coronavirus and the murine hepatitis virus, which have a so-called 2a gene 
and an HE-gene between the S protein and the viral polymerase. 
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The invention provides among others an isolated or recombinant nucleic acid 
or virus-specific functional fragment thereof obtainable from a virus according to the 
invention. In particular, the invention provides primers and/or probes suitable for 
identifying a SARS virus nucleic acid. 
5 Furthermore, the invention provides a vector comprising a nucleic acid according to 
the invention. To begin with, vectors such as plasmid vectors containing (parts of) the 
genome of SARS virus, virus vectors containing (parts of) the genome of SARS (for 
example, but not limited thereto, vaccinia virus, retroviruses, baculovirus), or SARS 
virus containing (parts of) the genome of other viruse or other pathogens are 
10 provided. 

Also, the invention provides a host cell comprising a nucleic acid or a vector according 
to the invention. Plasmid or viral vectors containing the polymerase components of 
SARS virus are generated in prokaryotic cells for the expression of the components in 
relevant oell types (bacteria, insect cells, eukaryotic cells). Plasmid or viral vectors 
15 containing full-length or partial copies of the SARS virus genome will be generated in 
prokaryotic cells for the expression of viral nucleic acids in- vitro or in-vivo. The latter 
vectors may contain other viral sequences for the generation of chimeric viruses or 
chimeric virus proteins, may lack parts of the viral genome for the generation of 
replication defective virus, and may contain mutations, deletions or insertions for the 

2 0 generation of attenuated viruses. 

Infectious copies of SARS virus (being wild type, attenuated, replication-defective or 
chimeric) can be produced upon co-expression of the polymerase components 
according to the state-of-the-art technologies described above. 

In addition, eukaryotic cells, transiently or stably expressing one or more full-length 
25 or partial SARS virus proteins can be used. Such cells can be made by transfection 
(proteins or nucleic acid vectors), infection (viral vectors) or transduction (viral 
vectors) and may be useful for complementation of mentioned wild type, attenuated, 
replication-defective or chimeric viruses. 

A chimeric viru6 may be of particular use for the generation of recombinant 

3 0 vaccines protecting against two or more viruses. For example, it can be envisaged 

that a SARS virus vector expressing one or more proteins of a human 
metapneumovirus or a human metapneumovirus vector expressing one or more 
proteins of SARS virus will protect individuals vaccinated with such vector against 
both virus infections. Such a specific chimeric virus is particularly useful in the 
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invention because it is suspected that co-infection o£ for instance, human 
metapneumovirus frequently occurs in SARS virus infetced patients. Attenuated and 
replication-defective viruses may be of use for vaccination purposes with live vaccines 
as has been suggested for other viruses. 
5 In a preferred embodiment, the invention provides a proteinaceous molecule or 

coronavirus-specific viral protein or functional fragment thereof encoded by a nucleic 
acid according to the invention. Useful proteinaceous molecules are for example 
derived from any of the genes or genomic fragments derivable from a virus according 
to the invention. Such molecules, or antigenic fragments thereof; as provided herein, 

10 are for example useful in diagnostic methods or kits and in pharmaceutical 

compositions such as sub-unit vaccines. Particularly useful are the viral polymerase 
protein, the spike protein or antigenic fragments thereof for inclusion as antigen or 
subunit immunogen, but inactivated whole virus can also be used. Particulary useful 
are also those proteinaceous substances that are encoded by recombinant nucleic acid 

15 fragments that are identified for phylogenetic analyses, of course preferred are those 
that are within the preferred bounds and metes of ORFs useful in phylogenetic 
analyses, in particular for eliciting SABS virus specific antibodies, whether in vivo 
(e.g. for protective puposes or for providing diagnostic antibodies) or in vitro (e.g. by 
phage display technology or another technique useful for generating synthetic 

20 antibodies). 

Also provided herein are antibodies, be it natural polyclonal or monoclonal, or 
synthetic (e.g. (phage) library-derived binding molecules) antibodies that specifically 
react with an antigen comprising a proteinaceous molecule or SARS virus-specific 
functional fragment thereof according to the invention. Such antibodies are useful in 

25 a method for identifying a viral isolate as a SARS virus comprising reacting said viral 
isolate or a component thereof with an antibody as provided herein. This can for 
example be achieved by using purified or non-purified SARS virus or parts thereof 
(proteins, peptides) using ELISA, RIA, FACS or similar formats of antigen detection 
assays (Current Protocols in Immunology). Alternatively, infected cells or cell 

3 0 cultures may be used to identify viral antigens using classical immunofluorescence or 
immunohistochemical techniques. Specifically useful in this respect are antibodies 
raised against SARS virus proteins which are encoded by a nucleotide sequence 
comprising one or more of the fragments disclosed in figure 2. 
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Other methods for identifying a viral isolate as a SAES virus comprise reacting said 
viral isolate or a component thereof with a virus specific nucleic acid according to the 
invention. 

In this way the invention provides a viral isolate identifiable with a method according 
to the invention as a mammalian virus taxonomically corresponding to a positive- 
sense single stranded UNA virus identifiable as likely belonging to the SAKS virus 
genus within the family of Coronaviruses. 

The method is useful in a method for virologically diagnosing a SARS virus infection 
of a mammal, said method for example comprising determining in a sample of said 
mammal the presence of a viral isolate or component thereof by reacting said sample 
with a nucleic acid or an antibody according to the invention. 
Methods of the invention can in principle be performed by using any nucleic acid 
amplification method, such as the Polymerase Chain Reaction (PGR; Mullis 1987, 
U.S. Pat. No. 4,683,195, 4,683,202, en 4,800,159) or by using amplification reactions 
such as ligase Chain Reaction (LCR; Barany 1991, Proc. Natl. Acad. ScL USA 
88:189-193; EP Appl. No., 320,308), Self-Sustained Sequence Replication (3SR; 
Guatelli et al., 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), Strand Displacement 
Amplification (SDA; U.S. Pat. Nos. 5,270,184, en 5,455,166), Transcriptional 
Amplification System (TAS; Kwoh et al., Proc. Natl. Acad. Sci. USA 86:1173-1177), Q- 
Beta Replicase (Lizardi et aL, 1988, Bio/Technology 6:1197), Rolling Circle 
Amplification (RCA; U.S. Pat. No. 5,871,921), Nucleic Acid Sequence Based 
Amplification (NASBA), Cleavase Fragment Length Polymorphism (U.S. Pat. No. 
5,719,028), Isothermal and Chimeric Primer-initiated Amplification of Nucleic Acid 
(ICAN), Ramification-extension Amplification Method (RAM; U.S. Pat. Nos. 5,719,028 
and 5,942,391) or other suitable methods for amplification of nucleic acids. 
In order to amplify a nucleic acid with a small number of mismatches to one or more 
of the amplification primers, an amplification reaction may be performed under 
conditions of reduced stringency (e.g. a PCR amplification using an annealing 
temperature of 38°C, or the presence of 3.5 mM MgCl2). The person skilled in the art 
will be able to select conditions of suitable stringency. 

The primers herein are selected to be "substantially" complementary (i.e. at least 
65%, more preferably at least 80% perfectly complementary) to their target regions 
present on the different strands of each specific sequence to be amplified. It is 
possible to use primer sequences containing e.g. inositol residues or ambiguous bases 
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or even primers that contain one or more mismatches when compared to the target 
sequence. In general, sequences that exhibit At least 65%, more preferably at least 
80% homology with the target DNA or RNA oligonucleotide sequences, are considered 
suitable for use in a method of the present invention. Sequence mismatches are also 
5 not critical when using low stringency hybridization conditions. 

The detection of the amplification products can in principle be accomplished by any 
suitable method known in the art. The detection fragments may be directly stained or 
labelled with radioactive labels, antibodies, luminescent dyes, fluorescent dyes, or 
enzyme reagents. Direct DNA stains include for example intercalating dyes such as 

10 acridine orange, ethidium bromide, ethidium monoazide or Hoechst dyes. 

Alternatively, the DNA or UNA fragments may be detected by incorporation of 
labelled dNTP bases into the synthesized fragments. Detection labels which may be 
associated with nucleotide bases include e.g. fluorescein, cyanine dye or BrdUrd. 
When using a probe-based detection system, a suitable detection procedure for use in 

15 the present invention may for example comprise an enzyme immunoassay (EIA) 
format (Jacobs et al., 1997, J. Clin. Microbiol. 35, 791-795). For performing a 
detection by manner of the EIA procedure, either the forward or the reverse primer 
used in the amplification reaction may comprise a capturing group, such as a biotin 
group for immobilization of target DNA PCR amplicons on e.g. a streptavidin coated 

2 0 microliter plate wells for subsequent EIA detection of target DNA -amplicons (see 
below). The skilled person will understand that other groups for immobilization of 
target DNA PCR amplicons in an EIA format may be employed. 

Probes useful for the detection of the target DNA as disclosed herein preferably bind 
only to at least a part of the DNA sequence region as amplified by the DNA 

2 5 amplification procedure. Those of skill in the art can prepare suitable probes for 

detection based on the nucleotide sequence of the target DNA without undue 
experimentation as set out herein. Also the complementary nucleotide sequences, 
whether DNA or RNA or chemically synthesized analogs, of the target DNA may 
suitably be used as type-specific detection probes in a method of the invention, 

3 0 provided that such a complementary strand is amplified in the amplification reaction 

employed. 

Suitable detection procedures for use herein may for example comprise 
immobilization of the amplicons and probing the DNA sequences thereof by e.g. 
southern blotting. Other formats may comprise an EIA format as described above. To 
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facilitate the detection of binding, the specific amplicon detection probes may 
comprise a label moiety such as a fluorophore, a chromophore, an enzyme or a radio- 
label, so as to facilitate monitoring of binding of the probes to the reaction product of 
the amplification reaction. Such labels are well-known to those skilled in the art and 
include, for example, fluorescein isothiocyanate (FITC), (5-galactosidase, horseradish 
peroxidase, streptavidin, biotin, digoxigenin, 35S or 125L Other examples will be 
apparent to those skilled in the art. 

Detection may also be performed by a so called reverse line blot (RLB) assay, such as 
for instance described by Van den Brule et aL (2002, J. Clin- MicrobioL 40, 779-787). 
For this purpose RLB probes are preferably synthesized with a 5 1 amino group for 
subsequent immobilization on e.g. carboxyl-coated nylon membranes. The advantage 
of an RLB format is the ease of the system and its speed, thus allowing for high 
throughput sample processing. 

The use of nucleic acid probes for the detection of RNA or DNA fragments is well 
known in the art. Mostly these procedure comprise the hybridization of the target 
nucleic acid with the probe followed by post-hybridization washings. Specificity is 
typically the function of post-hybridization washes, the critical factors being the ionic 
strength and temperature of the final wash solution. For DNA-DNA hybrids, the Tm 
can be approximated from the equation of Meinkoth and Wahl, Anal. Biochem., 138: 
267-284 (1984): Tm = 81.5 °C + 16.6 (log M) + 0.41 (% GQ-0.61 (% form)-500/L; where 
M is the molarity of monovalent cations, % GC is the percentage of guanosine and 
cytosine nucleotides in the DNA, % form is the percentage of formamide in the 
hybridization solution, and L is the length of the hybrid in base pairs. The Tm is the 
temperature (under defined ionic strength and pH) at which 50% of a complementary 
target sequence hybridizes to a perfectly matched probe. Tm is reduced by about 1 °C 
for each 1 % of mismatching; thus, The hybridization and/or wash conditions can be 
adjusted to hybridize to sequences of the desired identity. For example, if sequences 
with > 90% identity are sought, the Tm can be decreased 10 C. Generally, stringent 
conditions are selected to be about 5 C lower than the thermal melting point (Tm) for 
the specific sequence and its complement at a defined ionic strength and pH. 
However, severely stringent conditions can utilize a hybridization and/or wash at 
1,2,3, or 4 °C lower than the thermal melting point (Tm); moderately stringent 
conditions can utilize a hybridization and/or wash at 6,7,8,9, or 10 °C lower than the 
thermal melting point (Tm); low stringency conditions can utilize a hybridization 
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and/or wash at 11,12,13,14,15, or 20 °C lower than the thermal melting point (Tm). 
Using the equation, hybridization and wash compositions, and desired Tm, those of 
ordinary skill will understand that variations in the stringency of hybridization 
and/or wash solutions are inherently described. If the desired degree of mismatching 
5 results in a Tm of less than 45 °C (aqueous solution) or 32 °C (formamide solution) it 
is preferred to increase the SSC concentration so that a higher temperature can be 
used. An extensive guide to the hybridization of nucleic adds is found in Tijssen, 
Laboratory Techniques in Biochemistm and Molecular Biology — Hybridization with 
Nucleic Acid Probes, Part I, Chapter 2" Overview of principles of hybridization and 
10 the strategy of nucleic acid probe assays", Elsevier. New York (1993); and Current 
Protocols in Molecular Biology, Chapter 2, Ausubel, et al., Eds., Greene Publishing 
and Wiley- Interscience, New York (1995). [is this the same for RNA-RNA or RNA- 
DNA hybrids?] 

In another aspect, the invention provides oligonucleotide probes for the generic 
15 detection of target UNA or DNA. The detection probes herein are selected to be 

"substantially" complementary to one of the strands of the double stranded nucleic 
acids generated by an amplification reaction of the invention. Preferably the probes 
are substantially complementary to the immobilizable, e.g. biotin labelled, antisense 
strands of the amplicons generated from the target RNA or DNA. 
2 0 It is allowable for detection probes of the present invention to contain one or more 

mismatches to their target sequence. In general, sequences that exhibit at least 65%, 
more preferably at least 80% homology with the target oligonucleotide sequences are 
considered suitable for use in a method of the present invention. 

Antibodies, both monoclonal and polyclonal, can also be used for detection purpose in 

2 5 the present invention, for example, in immunoassays in which they can be utilized in 

liquid phase or bound to a solid phase carrier. In addition, the monoclonal antibodies 
in these immunoassays can be detectably labeled in various ways. A variety of 
immunoassay formats may be used to select antibodies specifically reactive with a 
particular protein (or other analyte). For example, solid-phase EUSA immunoassays 

3 0 are routinely used to select monoclonal antibodies specifically immunoreactive with a 

protein. See Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor 
Publications, New York (1988), for a description of immunoassay formats and 
conditions that can be used to determine selective binding. Examples of types of 
immunoassays that can utilize antibodies of the invention are competitive and non- 
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competitive immunoassays in either a direct or indirect format. Examples of such 
immunoassays are the radioimmunoassay (RIA) and the sandwich (immunometric) 
assay. Detection of the antigens using the antibodies of the invention can be done 
utilizing immunoassays that are run in either the forward, reverse, or simultaneous 
modes, including immunohistochemical assays on physiological samples. Those of 
skill in the art will know, or can readily discern, other immunoassay formats without 
undue experimentation. 

Antibodies can be bound to many different carriers and used to detect the presence of 
the target molecules. Examples of well-known carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified 
celluloses, polyacrylamides, agaroses and magnetite. The nature of the carrier can be 
either soluble or insoluble for purposes of the invention. Those skilled in the art will 
know of other suitable carriers for binding monoclonal antibodies, or will be able to 
ascertain -such using routine experimentation. 

The invention also provides a method for serologically diagnosing a SARS 
virus infection of a mammal comprising determining in a sample of said mammal the 
presence of an antibody specifically directed against a SARS virus or component 
thereof by reacting said sample with a proteinaceous molecule or fragment thereof or 
an antigen according to the invention 

Methods and means provided herein are particularly useful in a diagnostic kit 
for diagnosing a SARS virus infection, be it by virological or serological diagnosis. 
Such kits or assays may for example comprise a virus, a nucleic acid, a proteinaceous 
molecule or fragment thereof, an antigen and/or an antibody according to the 
invention. 

Use of a virus, a nucleic acid, a proteinaceous molecule or fragment thereof, an 
antigen and/or an antibody according to the invention is also provided for the 
production of a pharmaceutical composition, for example for the treatment or 
prevention of SARS virus infections and/or for the treatment or prevention of atypical 
pneumonia, in particular in humans. Attenuation of the virus can be achieved by 
established methods developed for this purpose, including but not limited to the use 
of related viruses of other species, serial passages through laboratory animals or/and 
tissue/cell cultures, site directed mutagenesis of molecular clones and exchange of 
genes or gene fragments between related viruses. 
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A pharmaceutical composition comprising a virus, a nucleic acid, a 
proteinaceous molecule or fragment thereof, an antigen and/or an antibody according 
to the invention can for example be used in a method for .the treatment or prevention 
of a SARS virus infection and/or a respiratory illness comprising providing an 
5 individual with a pharmaceutical composition according to the invention. This is most 
useful when said individual comprises a human. 

The invention also provides method to obtain an antiviral agent useful in the 
treatment of atypical pneumonia comprising establishing a cell culture or 
experimental animal comprising a virus according to the invention, treating said 

10 culture or animal with an candidate antiviral agent, and determining the effect of 

said agent on said virus or its infection of said culture or animal. An example of such 
an antiviral agent comprises a SARS virus-neutralising antibody, or functional 
component thereof as provided herein, but antiviral agents of other nature are 
obtained as well. The invention also provides use of an antiviral agent according to 

1 5 the invention for the preparation of a pharmaceutical composition, in particular for 
the preparation of a pharmaceutical composition for the treatment of atypical 
pneumonia, especifically when caused by a SARS virus infection, and provides a 
pharmaceutical composition comprising an antiviral agent according to the invention, 
useful in a method for the treatment or prevention of a SABS virus infection or 

2 0 atypical pneumonia, said method comprising providing an individual with such a 
pharmaceutical composition. 

The invention is further explained in the Examples without limiting it thereto. 
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Figure legends 

• * 

Fig. 1: Phylogenetic relationship for the nucleotide sequences of isolate HK39849 
5 with its closest relatives genetically. Phylogenetic trees were generated by maximum 
likelihood analyses using 100 bootstraps and 3 jumbles. The scale representing the 
number of nucleotide changes is shown for each tree. 

Fig. 2: Nucleotide sequences from 13 clones of parts of the SARS virus. Also included 
10 are the putative polypeptide sequences of polypeptides and alignmentsof the putative 
polypeptides with that of another member of the Coronoviridae family, where 
possible. 

Fig. 3: Schematic map of the SARS virus genome, indicating the position of the 
15 nucleotide sequences of figure 2 relative to the genome and a putative indication of 
the open reading frames of the genome based on analogy with other coronaviruses. 
EMC1-EMC12: sequences as provided in figure 2. CDC, BIN1-2: sequences known. 

Fig. 4: Amino acid comparison of the N-terminus of the S-protein of the SARS virus 
2 0 and closely related coronaviruses. HCV OC43 = human coronavirus isolate OC43; 
MHV A59 = murine hepatitis virus isolate A59, BCV = bovine corona virus. 
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Examples 

Virus isolation and characterisation 
Virus isolation and characterisation 
5 Isolate HK39849 was isolated from a hospitalised SARS patient by throat 

swab and inoculated into a culture of Vero-E6 cells. A sample of the supernatant from 
these infected cellswas used to inoculate VERO-118 cells and cell culture supernatant 
from these cells was aliquoted and frozen after one passage 

We isolated RNA from the virus-containing cell culture supernatant and 

1 0 subjected it to RNA arbitrarily primed PCJR (RAP-PCR) essentially as described by 

Welsh & McClelland (NAR 18:7213; PNAS USA 90:10710, 1993). Virus in the culture 
supernatants was purified on continuous 20-60% sucrose gradients. The gradient 
fractions were inspected for virus-like particles by EM, and RNA was isolated from 
the fraction containing , in which the most nucleocapsids were observed. Equivalent . 

1 5 amounts of RNA isolated from virus fractions were used for RAP-PCR, after which 
samples were run side by side on a 3% NuSieve agarose gel. Differentially displayed 
bands ranging in size from 200-1500 base pairs specific for the unidentified virus 
were subsequently purified from the gel, cloned in plasmid pCR2.1 (Invitrogen) and 
sequenced with vector-specific primers. When we used these sequences to search for 

2 0 homologies against sequences in the Genbank database using the BLAST software 

(www.ncbi.nlm.nih.gov/BLAST/) which yielded resemblance to virus sequences of the 
coronaviruses displayed in the phylogenetic tree of figure 1. 

Six of these fragments were located in the ORF coding for the viral polymerase (ORF 
lab), one (EMC-7) spanned the 3' end of ORFlab and reached into the 5' end of spike 

2 5 protein region, while an 8 th sequence (EMC8) spanned part of the Nucleocapsid 

coding sequence 

Phytogeny 

BLAST searches using nucleotide sequences obtained from the unidentified 

3 0 virus isolate revealed homologies primarily with members of the Coronaviridae. As 

an indication for the relation between the newly identified virus isolate and other 
coronaviruses a phylogenetic tree was constructed based on the sequence information 
obtained (figure 1). 
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Materials and Methods 



5 Specimen collection 

Virus was collected from SARS patients using throat swabs and from experimentally infected monkeys 
(throat and nasal swabs, serum, plasma and faeces) 

Virus isolation and culture 

1 0 Throat swabs were dipped into a culture of Vero-E6 cells and incubated for 1-4 days. Cell culture supernatant 
was clarified by centrifugation and filtered through a 0.45micrometre filter, before beings stored frozen. Hie 
virus was subsequently propagated in Vero-1 18 cells. 

Antigen detection by indirect IFA 

1 5 Samples from experimentally infected monkeys was cultured on Vero-118 cells in 24 well plates containing 
glass slides. These glass slides were washed with PBS and fixed in aceton for 1 minute at room temperature. 
After washing with PBS the slides were incubated for 30 minutes at 37 °C with SARS-antibody containing 
serum from SARS patients. After washing off the human serum in PBS, die slides were incubated at 37°C for 
30 minutes with FITC labeled anti-human antibodies. After three washes in PBS and one in tap water, the 

2 0 slides were included in a glycerol/PBS solution (Citifluor, UKC, Canterbury, UK) and covered. The slides 
were analysed using an Axioscop fluorescence microscope (Carl Zeiss B.V., Weesp, the Netherlands). 



Detection of antibodies in humans by indirect IFA 

2 5 Virus was cultured on Vero-1 18 cells in 24 well plates containing glass slides. These glass slides were washed 

with PBS and fixed in aceton for 1 minute at room temperature. After washing with PBS the slides were 
incubated for 30 minutes at 37 °C with SARS-antibody containing serum from SARS patients. After washing 
off the human serum in PBS, the slides were incubated at 37°C for 30 minutes with FITC labeled anti-human 
antibodies. After three washes in PBS and one in tap water, the slides were included in a glycerol/PBS 

3 0 solution (Citifluor, UKC, Canterbury, UK) and covered. The slides were analysed using an Axioscop 

fluorescence microscope (Carl Zeiss B.V., Weesp, the Netherlands 

Detection of antibodies in humans bvEUSA 
Patient samples^ 

35 4 samples of patients with SARS disease (S2: second sample of a pair, #3685, #3665, # 2528, #3668); 8 

samples of patients from routine serological virology (03-7809S, 03-78 10S, 03-7808S, 03-7807S, 03-7806S, 
02-10548S, 02-10566S, 03-7805S); an experimentally infected monkey (preserum and 9 days after infection 
ape 775). 

4 0 The Conjugate. 

The conjugate was tested in 3 concentrations (diluted in dilution buffer 9000-03, 1:100, 1:400 and 1:1600), 
both on polyvalent anti-IgM code MCB0201 (cross-reactive with monkey) and monoclonal anti-IgM, code 
9000-62 (non-crossreactive with monkey). 

4 5 Sera were diluted 1 :200 in serum diluent (code 9000-03), monkey 775 was diluted 1 : 1 00, 1 :200 and 1 :400. 

Serum incubation one hour at 37°C, conjugate incubation one hour at 37°C, and TMB (ready to use): 30 
minutes at room temperature. The reaction was stopped with sulphuric acid (0.5M). 

Results were interpreted by eye. Three of the four SARS-IgM positive sera (as detected by IF on infected 

5 0 cells) had a higher score than negative control sera.One serum had a score which was also reached by some of 

the negative controls. The monkey sera did not react Thus, this study shows that with direct conjugation the 
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developemnt of an IgM capture method is feasable. 
Virus characterisation 

For EM analyses, virus was concentrated from infected cell culture supernatants in a 
micro-centrifuge at 4 °C at 17000 x g, after which the pellet was resuspended in PBS 
and inspected by negative contrast EM 

RNA isolation 

RNA was isolated from the supernatant of infected cell cultures or sucrose gradient 
fractions using a High Pure RNA Isolation kit according to instructions from the 
manufacturer (Roche Diagnostics, Almere/The Netherlands). 

RT-PCR 

A one-step RT-PCR was performed in 50 ^il reactions containing 50 mM Tris.HCl pH 
8.5, 50 mM NaCl, 4 mM MgCk, 2 mM dithiotreitol, 200 jiM each dNTP, 10 units 
recombinant RNAsin (Promega, Leiden, the Netherlands), 10 units AMV RT 
(Promega, Leiden, The Netherlands), 5 units Amplitaq Gold DNA polymerase (PE 
Biosystems, Nieuwerkerk aan de Ijssel, The Netherlands) and 5 nl RNA. Cycling 
conditions were 45 min. at 42 °C and 7 min. at 95 °C once, 1 min at 95 °C, 2 min. at 
42 °C and 3 min. at 72 °C repeated 40 times and 10 min. at 72 °C once. 
Primers used for diagnostic PCJR: 
SARSfwd2: ggtggaacatcatccggtgat 
SARS rev2: agcctgtgttgtagattgcgg 

These primers amplify a 149bp fragment of the polymerase gene (orf lab) 
RT-PCR, gel purification and direct sequencing were performed as described above. 

RAP-PCR 

RAP-PCR was performed essentially as described by Welsh & McClelland (Nuc. Acid 
Res. 18:7213, 1990; Proc. Natl. Acad. Sci. USA 90:10710 1993) . The oligonucleotide 
sequences are described in addenda 2. For the RT reaction, 2 ill RNA was used in a 
10 jul reaction containing 10 ng/jd oligonucleotide, 10 mM dithiotreitol, 500 \xm each 
dNTP, 25 mM Tris-HCl pH 8.3, 75 mM KC1 and 3 mM MgCb. The reaction mixture 
was incubated for 5 min. at 70 °C and 5 min. at 37 °C, after which 200 units 
Superscript RT enzyme (LifeTechnologies) were added. The incubation at 37 °C was 
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continued for 55 min. and the reaction terminated by a 5 min. incubation at 72 °C 
The RT mixture was diluted to give a 50 pi PCR reaction containing 8 ng/|xl 
oligonucleotide, 300 jxm each dNTP, 15 mM Tris-HCL pH 8.3, 65 mM KC1, 3.0 mM 
MgCLa and 5 units Taq DNA polymerase (PE Biosystems). Cycling conditions were 5 
5 min. at 94 °C, 5 min. at 40 °C and 1 min. at 72 °C once, followed by 1 min. at 94 °C, 2 
min. at 56 °C and 1 min. at 72 °C repeated 40 times and 5 min. at 72°C once. After 
RAP-PCR, 15 \il the RT-PCR products were run side by side on a 3% NuSieve agarose 
gel (FMC BioProducts, Heerhugowaard, The Netherlands). Differentially displayed 
fragments were purified from the gel with Qiaquick Gel Extraction kit (Qiagen, 
10 Leusden, The Netherlands) and cloned in pCR2.1 vector (Invitrogen, Groningen, The 
Netherlands) according to instructions from the manufacterer. 

Sequence analysis 

RAP-PCR~products cloned in vector pCR2.1 (Invitrogen) were sequenced with M13- 
15 specific oligonucleotides. DNA fragments obtained by RT-PCR were purified from 

agarose gels using Qiaquick Gel Extraction kit (Qiagen, Leusden, The Netherlands), 
and sequenced directly with the same oligonucleotides used for PCR. Sequence 
analyses were performed using a Dyenamic ET terminator sequencing kit 
(Amersham Pharmacia Biotech, Roosendaal, The Netherlands) and an ABI 373 
2 0 automatic DNA sequencer (PE Biosystem). All techniques were performed according 
to the instructions of the manufacturer. 



2 5 RT-PCR for diagnosing SARS virus. 

For the amplification of the SARS virus* genetic material, we used primers: 
SARS fwd2: ggtggaacatcatccggtgat 

3 0 SARS rev2: agcctgtgttgtagattgcgg 

These primers amplify a 149bp fragment of the polymerase gene (orf lab) 
RT-PCR, gel purification and direct sequencing were performed as described above. 
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Phylogenetic analyses 

For all phylogenetic trees, DNA sequences were alligned using the ClustalW software 
package and maximum likelihood trees were generated rising the DNA-ML software 
package of the Phylip 3.5 program using 100 bootstraps and 3 jumbles 15 . Previously 
published sequences for TGEV, PEDV, 229E, AIBV, BoCo and MHV that were used 
for the generation of phylogenetic trees are available from Genbank 

Examples of methods to identify SARS virus 

Specimen collection 

In order to find virus isolates nasopharyngeal aspirates, throat and nasal swabs, 
broncheo alveolar lavages, serum and plasma samples, and stools preferably from 
mammals-such as humans, carnivores (dogs, cats, mustellits, seals etc.), horses, 
ruminants (cattle, sheep, goats etc.), pigs, rabbits, birds (poultry, ostriches, etc) 
should be examined. From birds cloaca swabs and droppings can be examined as well. 
Sera should be collected for immunological assays, such as ELISA, molecular-based 
assays, such as RT-PCR and virus neutralisation assays. 
Collected virus specimens were diluted with 5 ml Dulbecco MEM medium 
(BioWhittaker, Walkersville, MD) and thoroughly mixed on a vortex mixer for one 
minute. The suspension was thus centrifiiged for ten minutes at 840 x g. The 
sediment was spread on a multispot slide (Nutacon, Leimuiden, The Netherlands) for 
immunofluorescence techniques, and the supernatant was used for virus isolation. 



Virus isolation 

For virus isolation Vero-118 cells or tMK cells (RIVM, Bilthoven, The Netherlands) 
were cultured in 24 well plates containing glass slides (Costar, Cambridge, UK), with 
the medium described below supplemented with 10% fetal bovine serum 
(BioWhittaker, Vervier, Belgium). Before inoculation the plates were washed with 
PBS and supplied with Eagle's MEM with Hanks' salt (ICN, Costa mesa, CA) 
supplemented with 0.52/liter gram NaHCOs, 0.025 M Hepes (Biowhittaker), 2 mM L- 
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glutamine (Biowhittaker), 200 units/liter penidlline, 200 ng/liter streptomycine 
(Biowhittaker), 1 gram/liter lactalbumine (Sigma-Aldrich, Zwijndrecht, The 
Netherlands), 2.0 gram/liter D-glucose (Merck, Amsterdam, The Netherlands), 10 
gram/liter peptone (Oxoid, Haarlem, The Netherlands) and 0.02% trypsine (life 
5 Technologies, Bethesda, MD). The plates were inoculated with supernatant of the 
patient samples, 0,2 ml per well in triplicate, followed by centrifuging at 840x g for 
one hour. After inoculation the plates were incubated at 37 °C for a maximum of 1-3 
days and cultures were checked daily for CPE.. Extensive CPE was generally observed 
within 24hours. and included detachment of cells from the monolayer.. 

10 

Virus culture ofSARS 

Sub-confluent monolayers of tMK cells or Vero clone 118 cells in media as described 
above were inoculated with supernatants of samples that displayed CPE or with 
samples taken from patient or artificially infected monkeys.-. 

15 

Virus characterisation 

For EM analyses, virus was concentrated from infected cell culture supernatants in a 
micro-centrifuge at 4 °C at 17000 x g, after which the pellet was resuspended in PBS 
and inspected by negative contrast EM. 
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Antigen detection by indirect IFA 

25 

Virus was cultured on Vero- 118 cells in 24 well slides containing glass slides. These 
glass slides were washed with PBS and fixed in aceton for 1 minute at room 
temperature. 

After washing with PBS the slides were incubated for 30 minutes at 37 °C with SAR8 
3 0 patient serum. We used patient serum, but antibodies can be raised in various 
animals, such as ferrets, rabbits and mice (for monoclonal antibodies), and the 
working dilution of the antibody can vary for each immunisation. After three washes 
with PBS and one wash with tap water, the slides were incubated at 37°C for 30 
minutes with FITC labeled goat-anti-human antibodies .After three washes in PBS 
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and one in tap water, the slides were included in a glycerol/PBS solution (Citifluor, 
UKC, Canterbury, UK) and covered. The slides were analysed using an Axioscop 
fluorescence microscope (Carl Zeiss B.V., Weesp, the Netherlands). 

5 Detection of antibodies in humans by indirect JFA 

For the detection of virus specific antibodies, SAR3 virus-infected Vero cells were 
fixed with acetone on coverslips (as described above), washed with PBS and 
incubated 30 minutes at 37°C with serum samples at a 1 to 16 dilution. After two 
washes with PBS and one with tap water, the slides were incubated 30 minutes at 
10 37°C with FITC-labelled secohdaiy antibodies to human antibodies (Dako). Slides 
were processed as described above. 

Antibodies can be labelled directly with a fluorescent dye, which will result in a direct 
immuno fluorescence assay. FITC can be replaced with any fluorescent dye. This 
technique-can be applied to antibodies in other animals such as mammals, 
15 ruminants, birds or other species, assuming the secondary antibody to the 
appropriate species is used. 

Animal immunisation 

Cynomologous macaque specific antisera for the newly discovered virus were 
2 0 generated by experimental intratrachael installation of cultured virus of 

Cynomologous macaques. One to two later the animals were bled. The sera were 
tested for reactivity to SARS virus by indirect 1FA as described above; uninfected 
control cells were used to ensure the specificity of the serum. Other animal species 
are also suitable for the generation of specific antibody preparations and other 

2 5 antigen preparations may be used. 

RNA isolation 

3 0 RNA was isolated from the supernatant of infected cell cultures or sucrose gradient 

fractions using a High Pure RNA Isolation kit according to instructions from the 
manufacturer (Roche Diagnostics, Almere, The Netherlands). RNA can also be 
isolated following other procedures known in the field (Current Protocols in Molecular 
Biology). 
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RT-PCR 

A one-step RT-PCR was performed in 50 ^1 reactions containing 50 mM Tris.HCl pH 
5 8.5, 50 mM NaCl, 4 mM MgCh, 2 mM dithiotreitol, 200 pM each dNTP, 10 units 
recombinant RNAsin (Promega, Leiden, the Netherlands), 10 units AMV RT 
(Promega, Leiden, The Netherlands), 5 units Amplitaq Gold DNA polymerase (PE 
Biosystems, Nieuwerkerk aan de Ijssel, The Netherlands) and 5 yl RNA. Cycling 
conditions were 45 min. at 42 °C and 7 min. at 95 °C once, 1 min at 95 °C, 2 min. at 
10 42 °C and 3 min. at 72 °C repeated 40 times and 10 min. at 72 °G once. 
Primers used for diagnostic PCR: 

For the amplification of the SARS virus' genetic material, we used primers: 
SARS fwd2: ggtggaacatcatccggtgat 
15 SARS rev2: agcctgtgttgtagattgcgg 

These primers amplify a 149bp fragment of the polymerase gene (orf lab) 
RT-PCR, gel purification and direct sequencing were performed as described above. 

Sequence analysis 

2 0 Sequence analyses were performed using a Dyenamic ET terminator sequencing kit 
(Amersham Pharmacia Biotech, Roosendaal, The Netherlands) and an ABI 373 
automatic DNA sequencer (PE Biosystem). All techniques were performed according 
to the instructions of the manufacturer. PCR fragments were sequenced directly with 
the same oligonucleotides used for PCR, or the fragments were purified from the gel 

2 5 with Qiaquick Gel Extraction kit (Qiagen, Leusden, The Netherlands) and cloned in 
pCR2.1 vector (Invitrogen, Groningen, The Netherlands) according to instructions 
from the manufacturer and subsequently sequenced with M13-specific 
oligonucleotides . 
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Detection of antibodies in humans, mammals, ruminants or other animals by ELISA 
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. A recombinant protein derived from the SARS virus is preferred as the antigen. 
However, purified nucleocapsids may also be used* Antigens suitable for antibody 
detection include any SARS protein that combines with any SARS-specific antibody of 
a patient exposed to or infected with SARS virus. Preferred antigens of the invention 
include those that predominantly engender the immune response in patients exposed 
to SARS, which therefore, typically are recognised most readily by antibodies of a 
patient. Particularly preferred antigens include the N, and S proteins of SARS. 
Antigens used for immunological techniques can be native antigens or can be 
modified versions thereof. Well known techniques of molecular biology can be used to 
"alter the amino acid sequence of a SARS antigen to produce modified versions of the 
antigen that may be used in immunologic techniques. 

Methods for cloning genes, for manipulating the genes to and from expression 
vectors, and for expressing the protein encoded by the gene in a heterologous host are 
well-known, and these techniques can be used to provide the expression vectors, host 
cells, and the for expressing cloned genes encoding antigens in a host to produce 
recombinant antigens for use in diagnostic assays. See for instance: Molecular 
cloning, A laboratory manual and Current Protocols in Molecular Biology. 
A variety of expression systems may be used to produce SARS antigens. For instance, 
a variety of expression vectors suitable to produce proteins in E.Coli, B.subtilis, 
yeast, insect cells and mammalian cells have been described, any of which might be 
used to produce a SARS antigen suitable to detect anti- SARS antibodies in exposed 
patients. 

The baculovirus expression system has the advantage of providing necessary 
processing of proteins, and is therefor preferred. The system utilizes the polyhedxin 
promoter to direct expression of SARS antigens. (Matsuura et al. 1987, J.Gen.Virol. 
68: 1233-1250). 

Antigens produced by recombinant baculo-viruses can be used in a variety of 
immunological assays to detect anti- SARS antibodies in a patient. It is well 
established, that recombinant antigens can be used in place of natural virus in 
practically any immunological assay for detection of virus specific antibodies. 
The assays include direct and indirect assays, sandwich assays, solid phase assays 
such as those using plates or beads among others, and liquid phase assays. Assays 
suitable include those that use primary and secondary antibodies, and those that use 
antibody binding reagents such as protein A. Moreover, a variety of detection 



28 



methods can be used in the invention, including colorimetric, fluorescent, 
phosphorescent, chemiluminescent, luminescent and radioactive methods. 

5 
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Claims 



0 9 C/ 



l. 




10 



15 



20 



25 



(SARS) comprising one or more of the sequences of figure 2. 

2. An isolated positive-sense single stranded RNA virus (SARS) belonging to the 
Coronaviruses and identifiable as phylogenetically corresponding thereto by 
determining a nucleic acid sequence of said virus and testing it in phylogenetic tree 
analyses wherein maximum likelihood trees are generated using 100 bootstraps and 
3 jumbles and findingdtto be more closely phylogenetically corresponding-to a virus 
isolate having the sequences as depicted in figure 2 than it is corresponding to a virus 
isolate of BoCo (bovine coronavirus), MHV (murine hepatitis virus), AIBV (avian 
infectious bronchitis virus), PEDV (porcine epidemic diarrhea virus), TGEV 
(transmissible gastroenteritis virus) or 22 9E (human coronavirus 229E).. 

3. A virus according to claim 1 or 2 wherein said nucleic acid sequence comprises 
an open reading frame (ORF) encoding a viral protein of said virus. 

4. A virus according to claim 3 wherein said open reading frame is selected from : 
the group of ORFs encoding the viral replicase, nuclear capsid protein and the spike 
protein. 

5. A virus according to claim 1-4 isolatable from a human with atypical 
pneumonia. 

6. An isolated or recombinant nucleic acid or SARS virus-specific functional 
fragment thereof obtainable from a virus according to anyone of claims 1 to 5. 

7. A vector comprising a nucleic acid according to claim 6. 

8. A host cell comprising a nucleic acid according to claim 6 or a vector according 



to claim 7. 



9. An isolated or recombinant proteinaceous molecule or SARS virus-specific 
functional fragment thereof encoded by a nucleic acid according to claim 6. 

10. An antigen comprising a proteinaceous molecule or SARS virus-specific 
functional fragment thereof according to claim 9. 

11. An antibody specifically directed against an antigen according to claim 10- 

12. A method for identifying a viral isolate as a SARS vims comprising reacting 
said viral isolate or a component thereof with an antibody according to claim 11. 

13. A method for identifying a viral isolate as a SARS virus comprising reacting 
said viral isolate or a component thereof with a nucleic acid according to claim 6. 

14. A method for virologically diagnosing a SARS infection of a mammal 
comprising determining in a sample of said mammal the presence of a viral isolate or 
component thereof by reacting said sample with a nucleic acid according to claim 6 or 
an antibody according to claim 11. 

15. A method for serologically diagnosing a SARS infection of a mammal 
comprising determining in a sample of said mammal the presence of an antibody 
specifically directed against a SARS virus or component thereof by reacting said 
sample with a proteinaceous molecule or fragment thereof according to claim 9 or an 
antigen according to claim 10. 

16. A diagnostic kit for diagnosing a SARS infection comprising a virus according 
to anyone of claims 1 to 5, a nucleic acid according to claim 6, a proteinaceous 
molecule or fragment thereof according to claim 9, an antigen according to claim 10 
and/or an antibody according to claim 11. 

17. Use of a virus according to any one claims 1 to 5, a nucleic acid according to 
claim 6, a vector according to claim 7, a host cell according to claim 8, a proteinaceous 
molecule or fragment thereof according to claim 9, an antigen according to claim 10, 
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or an antibody according to claim 11 for the production of a pharmaceutical 
composition. 

18. Use according to claim 17 for the production of a pharmaceutical composition 
5 for the treatment or prevention of a SAKS virus infection. 

19. Use according to claim 17 or 18 for the production of a pharmaceutical 
composition for the treatment or prevention of atypical pneumonia. 

"10 20. A pharmaceutical composition comprising a virus according to any one claims 
1 to 5, a nucleic acid according to claim 6, a vector according to claim 7, a host cell 
according to claim 8, a proteinaceous molecule or fragment thereof according to claim 
9, an antigen according to claim 10, or an antibody according to claim 11. 

15 21. A method for the treatment or prevention of a SARS virus infection comprising 
providing an individual with a pharmaceutical composition according to claim 20. 

22. A method for the treatment or prevention of atypical pneumonia comprising 
providing an individual with a pharmaceutical composition according to claim 20. 

20 

23. A viral replicase encoded by an RNA sequence comprising the sequences EMC-' 
1, EMC-2, EMC-3, EMC-4, EMC-5, EMC-6, EMC-7 and/or EMC-13 as depicted in 
figure 2. 

25 24. A viral spike protein encoded by an RNA sequence comprising the sequence 
EMC-7 as depicted in figure 2. 

25. A viral nuclear capsid protein encoded by an RNA sequence comprising the 
sequence EMC-8 as depicted in figure 2 
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32 EPO - DO 1 

Abstract ° « " ^ 

a 

The invention relates to the field of virology. The invention provides an 
isolated essentially mammalian positive-sense single stranded RNA virus 
(SARS) within the group of coronaviuses and components thereof. 
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Figure 2 RNA sequences, implied polypeptides and alignment with one close 
relative 



EMC-1 
GCAGCI_ 

UGGCCUC^UCX^CUUAG^ 



UCGAGACGCCCGUUGAUJ? 

UGGCCUCAUGCUCUUAG? 

CUGraCUACaAACAAU^ 
GAGAAGAUAC^GUUUGGGAAGUUCAGG^UUAC 

CUGAUC 
Translation 

Nucleotides £4§3; 164 aa translation 

FACWAEAWKTLQPVSD 
Alignment 

Bovine Coronavirus RNA-Dependent RNA polymerase 
Identities = 39/159 (24%), 

Sbj ct: 826 AGoLYPV^DGHV G LL^ R W R VPCAG--R R V T r KEQ P-rVH E IA8TPKTIKV F YELDKDF 883 

Ouery: 554 D™""™?""™? 0TW«nm 670 
Sbjct: 884 OTIUWACGVFEVDDTVM4BEFYAWVDAIEBKLSPCKE 922 




Translation 
Alignment 

> Bovine Coronavirus RNA-Dependent RNA polymerase 
) Length = 7094 

Identities = 21/65 (32%) 

Query 4 SLGPTYIiDGADVTKIKPHVNHEGKTFE^LPSDDTLRSERFBYYHTLDESFLGRyMSRLNH 183 
S 1590 l^^lAc^l^^O^S^^^ 1- 
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Query: 184 TKKWK 198 
KW+ 

Sbjct: 1650 CSKWQ 1654 



A^IAAAUGAAAUCAtHJGGUUA 



CUGAUGAUUGUUUUGO^UAAACAUGCUGG^ 

UUCAl " * — ~ — ^tt.tt^ttt™, aa 



UUCAUACAAAAAUGACAAAAGCUGCCOTG^G3GOT 



:CUGGCUUACCC^UACUGUGCU^^^^ 
3UGUUUUUAGUGCI 
GUAUAGUGAUUUUGCUACCUCU 

Translation 

ATS 

^MSine hepatitis Virus RNA- Dependent RNA polymerase 
Identities 48/126 (38%), 

Query: 78 TC^UUKT^^ 251 
Sbjct: 2859 WRIMPTYRVHKSDMQLPLYASFKVIDNG\^RDVSVTDACFJVNKFNQFDQWYESTFGIAY1f 2918 



SD^Ct: W«W»ri*rtvn»^w- — 

Query: 252 KNDKSCPWAAIITREIGFIVPGLPGTVI»RAINGDF^FLPRVFSAV<3NICYTPSKLIE^f 431 
uuc Y .„ k+cpw A+I ++IG + +P TVLR I*HF+ F+ CYTP I J 

Sbjct: 2919 ^S^CPVVTOVIDQDlGHTLFlWPl^VLR-yGFHVLHFITHAFaTDSVQCYTPHMQIPY 2977 



Query: 432 SDFATS 449 
+F S 

Sbjct: 2978 DMFYAS 2983 



UGCUGCUGUUAUCAAUGGUGAUA 
Translation 

Alignment 

RNA-directed RNA polymerase murine hepatitis virus 

Identities = 122/222 (54%) 

Query 8 SITSlWWJBGFBKmrPSfflt^^ " 7 

S^t: 332, S^SCX^^ »» 

Q^y: 188 367 

Sbjct: 3386 m£!L££^^ 3445 
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Query: 368 SVLACYNGSPSGVYQCAMRPNHTIKGS FLNGSCGSVGFNI DYDCVS FC YMHHMEL PTGVH 547 

+VLA YNG P G + +R +HTIKGSFL GSCGSVG+ + D V F YMH +EL TG H 
Sbjct: 3446 TVIAAYNGRPQGAFHVTIiRSSHTIKGSFIjCGSCG^ 3505 

5 

Query: 548 AGTDLEGKFYGPFTORQTAQAAGTDTTITLNVLAWIiYAAVXN 673 

GTD G FYGP+ D Q Q D T T+NV+AWLYAA+ N 
Sbjct: 3506 TGTDFSGNFYG PYRDAQWQLPVQDYTQTVNWAWIjYAAI FN 3547 

10 

EMC-5 

Note that this sequence is not fully in frame. 
AGUUGGAAAAGAUGGCAGAUCAGC^^ 
1 5 CAAGAGGGGAAAAGUAACTIAGUGCUA 

GAUAAUGAUGCACUUAAC^CAUUAUC^ 
UC^UACCAUUGACtmC^GC^ 

GAACACUUGUGAUGGUAACACCUUUACAXIAUGCAUCUGCAOT 
GUUGAUGCGGAUAGCAAGAUUGUUCAACIJUAGUGAAA 
2 0 UGGCUUGGCCCCUUAUUGUUACAGCUCUAAGAGCCAAOJCAGOT 
UGAACUGAGUCCAGUAGCACUACGACAGAUGUCCUGUG 
UGUACUGAUGACAAUGCACUUGCCUACUAUAACAAUUCGAAGC^ 
CAUUACUAUCAGACCACCAAGAUOTCA 
UACAAUUUACACAGAACUGGAACC^CCUUGUAGGU^ 

2 5 AAAGUGAAAUACUUGUACUUC^U^ 

CAGUUUAGCUGCUACAGUACGUCUUC^GGCUGGAAAUGCU 

ACUGUGCUUUCC^JUCUGUGCUUUUGCA 

GCAAGUGGAGGACAACCAAUGACCAA 

GACAGGCAAUUACUGUAACACCAGAAGCUAACAUGGAC CAAGAGUC CQYUGGUGGUGCUUC 

3 0 AUGUUGUCUGUAUUGUAGAUGCCAC^^ 

AAAGGUAAGUACGUCCAAAUACCUACCACUUGUGCUAAUG^ 

GAAACACAGUCUGUACCGUCUGCGGAAUGUGGAAAGGUUAUGGCUGUAGUUGUGACCAAOT 
CCGCGAACCCUUGAUGC^GUCUGCGGAUGC^UC^ 

GUGCAGCCCGUCUimCACCGUGCGGCACAGGCACUAGIJACUGAUGUCGUCUAC^ 

3 5 UGAUAUUUACAACGAAAAAGUUGCUGGUUYU 

Translation 1 

Nucleotide 3-701 ; 233 aa 

LEKMADQAMTQMYKQARSE DKRAKVTSAMQTML FTMLRKLDNDALNN I INNARDGCVPLNI I PLTTAAKLMW 
40 VPDYGTYKNTCDGNTFTYASALWEIQQWDADSKIVQLSEINMDNSPNLAWPLIVTALRANSAVKLQNNELSP 
VALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVLALLSDHQDLKWARFPKSDGTGTIYTELEPPCRFVTDT 
PKGPKVKYLYFIKA 

Translation 2 

4 5 FXRVCGVS A- ARLTPCGTGTSTD WYRAFD I YNEKVAGXAKFLK 

Alignment 1 

RN A- Dependent RNA Polymerase: bovine coronavirus 
Identities = 181/413 (43%), 

50 

Query: 3 LEKMADQAMTQMYKQARSEDKRAKVTSAMQ TML FTMLRKXXXXXXXXXXXXXRDGCVPLN 182 

LE+MAD A+T MYK+AR DK++KV SA+QTMLF+M+RK GCVPLN 
Sbjct: 3985 LERMADLALTNMYKEARI NDRKS K WSALQTML FSM VRKXrDNQALN S I L DNAVKGCVPLN 4044 

55 Query: 183 IIPLTTAAKLMVWPDYGTYECNTCDGNTFTYASALWBIQQWDADSKIVQLSEINMDNSP 362 
IP A L ++VPD Y D TYA +W+IQ + D+D QL+EI+ D + 
Sbjct: 4045 AIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQIQTIQDSDGTNKQLNEISDDCN- 4103 

Query: 363 NLAWPL IVTALRAN — SAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALA YYNNSKGG 536 
60 WPL++ A R N SA LQNNEL P L+ +G QT T YYNNS G 

Sbjct: 4104 WPLVIIANRHNEVSATVLQNNELMPAKLKTQWMSGPDQTCNTPTQ — CYYNNSNNG 4158 

Query: 537 RFVLALLSDHQDLKWAR F PKSDGTGTI YTELEPPCRFVTD TPKGPKVKYL YFIKA * TT * I 716 
+ V A+LSD LK+ + K DG + EL+PPC+F KG K+KYLYF+K T 

65 Sbjct: 4159 KIVYAILSDVDGLKYTKILKDDG-KFWLELDPPCKFTVQDVKGIiKIKYIiyFVKGCNTLA 4217 
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Query: 717 EVWCWAV*LLQYVFRL EMLQK YL P I QLCFP S VLLQ* TLLKH I T 878 

Sbjct: 4218 R — GWVVGTISSTVRLQAGTATEYASNSSILSLCAFSVDPKKTYL DFIQQGGTPIA 4271 

Ouerv 879 {jcvKMLCTHTGTGQAITVTPEANMDQJSSXGCSa^ 1058 
Query. 879 H CT g AITV P+A +Q+S GGAS C+YCR ++HP+ G C L+GK+VQ+ 

Sbjct: 4272 NCVKMIXDHAGTGMAI TVKP DATTNQDS YGGAS VC I YCRARVEH^ 4331 

Ouerv 1059 PTTCANDPVG FTI*RNTVCTVCGMWKGYGCSCDQI*RE PLMQS ADASXFkHGFAV 1217 
Query. 1U!>3 ^ ArWiW Dpv + L + vc VCG W+ CSC + +QS D + FLNGF V 
Sbjct: 4332 p VG- 1 KDPVS YVLTHDVCQVCGFCTRDGSCSCVS -T DTTVQS KDTN- FLNGFGV 4381 

A R^-dTrecte/RNA polymerase (ORF1B) [murine hepatitis virus] 
Identities = 24/44 (54%), 

Ouerv 1199 FKRVCGVS A-ARLTPCGTGT STDVVYRAFDI YNEKVAGXAKFLK 1327 

FKRV G S AKL PC +G TDV RAFDI N AG + K 
Sbjct: 18 FKRVRGT S VNARLVPCASGXDT DVQLRAFDI CNANRAGI GI» Y YK 61 

EMC -6 m __ , _ 

ttrnte* that this sequence is not fully in irame. 
GUACAAUUOTGCGAUGCUAUGCGU 

AGTOC^UUGUGGAUUCAUAUUACUCAUUGCUGAU^ 

AaCUUUAAUGUGUUAUUUUCUACUGUGUUUc^ 
AAAUAUUUGUAGAUGGUGUUCCUUCUGUUGUUUCAACUGGAUA 

^CGUAC^UAAUC^GGAUGUA^ 
GUGUAUGCUGOTGAUCCAGCUAUGCAUGCAGCXTU 
OTACAUGCUUUUCAGUAGCUCCACUAACAAACAAU^ 
UAAUUUUAAUAAAGACUUUUAUGACUUUGOTGUGUOTA^ 

Translation 1 

LFSTVFPPTSFGPLVRKIFVDGVPSWSTGYHFRELGWHNQDVNLHSSRLSFKELL^ 

Translation 2 

656 to 772: Frame 2; 39 aa 

LLDKRTTCFSVAPLTNNVAFQT VKPGN FNKDFYDFAVSK 

Alignment 

ORFlab polyprotein Murine hepatitis virus 
Identities - 157/257 (61%), 

Ouerv 2 DILRVYANLGERVRQSLLKTVQFCDA^ 181 
Query. Z DI+VY LG ++LL T +F DA+ +AG+VGVLTLDNQDL G WYDFGDFV+ PGC 
Sbjct: 4626 DI I ^ YKKIjGPI FNRALLNTAKFADALVBAGLVGVLTLDNQDLYGQW YDFGDFVKTV PGC 4685 

Ouerv 182 GVPIVDSYYSI*LMPILTLTRAIJU\ESHMDADLAKPLIKWDl*IiKHDFTEERiCL 361 
Query. 1BZ ^ V DSYYS +MP+LT+ al +E ++ + +DL+++DFT+ +L LF +YFK+ 
Sbjct: 4686 GVAVADS Y Y S YMMPMI/TMCHALDS EL FVNGT YRE FDLVQYDFTDFKLELFTKYFKH 4741 

Query 362 W DQT YHPNC INCLDDRCI LHC AN FN VL FST VFP PTS FG PLVRK I EX^X^XX^? ?2 541 
Query. so* TYHPN C DDRCI+HCANFN+LFS V P T FG PLVR+ 1 FVDGVP WS GYH++ 

Sbjct: 4742 WSMTYHPNTCECEDDRCIIHCANFNILFSMVLPKTCFGPLVRQIFVTDGVPFVVSIGYHYK 4801 

Query: 542 ELG WHNQDVNLH S S RLS FKELL VYAADPAMHAASGN * LL ^ 

ELGW N DV+ H RLS K+LL+YAADPA+H AS + LLD RT ™I * ocl 

Sbjct: 4802 El^VVMNMDVDTHRYPXSLKDLLLYAADPALHVASASALLDLRTCCFSVAAITSGVKFO/T 4861 



38 



Query: 722 VKPGNFNKDFYDFAVSK 772 

VKPGNFN+DFY+F +SK 
Sbjct: 4862 VKPGNFNQDFYEFILSK 4878 

EMC- 7 This sequence also contains the beginning of the next 
ORF. 

ACCUUCAGAAODAUGGUGAAAADGCUGUDAUACCACAAGGAAOAADGAUGAAUGUCGCAAAGUAUACaCAACU 
GUGUCAAOACUUAAAUACACUUACODUAGCOGUACCCUACAACAUGAGAGUUAUUCACUUUGGUGCUGGCDCU 
GAUAAAGGAGOUGCACCAGGUACAGCOGUGCDCAGACAAUGGUUGCCAACUGGCACACUACUUGUCGAODCAG 
ADCUUAAaGACUUCGUCUCCGACGCAGAUDCOACUDOAAUUGGAGACUGUGCAACAGUACAOACGGCaAAUAA 
ADGGGACCUDAUOAUUAGCGAUAUGUAUGACCCOAGGACCAAACAUGUGACAAAAGAGAAOGACOCUAAAGAA 
GGGOUODUCACUDAUCDGUGUGGAUUUAUAAAGCAAAAACUAGCCCUGGGUGGOUCUAUAGCUGUAAAGAUAA 
CAGAGCAOUCUOGGAAUGCUGACCUUUACAAGCDUAUGGGCCAUOTCUCADGGOGGACAGCUUUOGDDACAAA 
UGUAAAUGCAUCAUCAUCGGAAGCAUUUUOAAUUGGGGCUAACUAUCUUGGCAAGCCGAAGGAACAAAUaGAU 
GGCUAUACCAUGCAUGCUAACUACAUUUDCUGGAGGAACACAAAUCCUAUCCAGUDGUCUUCCUAUDCACUCU 
UUGACAUGAGCAAAUUUCCUCUUAAAUUAAGAGGAACUGCOGUAAUGUCOCaUAAGGAGAAUC^^DCAADGA 
UAUGAUUUAUUCUCUUCUGGAAAAAGGUAGGCUDADCAUDAGAGAAAACAACAGAGUOGOGGUUUCAAGDGAO 
AUUCOUGUUAACAACUAAACGAACAUGOUDAOUDUCDUAUUAUUUCUUACUCDCACUAGUGGUAGUGACCOaG 
ACCGGUGCACCACUUUUGAUGAUGUUCAAGCOCCUAAUUACA 

Translation 1 

Nucleotides 3 to 818: Frame 3 272 aa 

LQNYGENAVIPQGIM^VAKYTQLCQYI^TLTLAV 
DSTLIGDCAWHTANKWDLIISDMYDPRTKHVTKENDS 
WWTAFVTNVNASSSEAFLIGANYLGKPKEQIDGYTMHA1TCIFWRN 
MI YSLLEKGRLI IRENNRVWS S D ILVNN 



Translation 1 

Nucleotide 2 to 673: Frame 2; 224 aa (orf lab) 

DKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPRTPCHVTKENDSKE 
GFFT YLCGFIKQKLALGGS I AVKI TEHS WNADL YKLMGH FS WWTAFVTN VNAS S SEAFL I GAN YLGKPKEQI D 
GYTMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQINDMIY 
ILVNN 

Translation 2 

Nucleotide 683-772 (frame 2) 32aa (S protein) 
MFIFLLFLTLTSGSDLDRCTTFDDVQAPNY 

Alignment 1 

replicase [bovine coronavirus] 
Identities - 183/271 (67%), 

Query: 3 LQN YGENAVI PQGIMMNVAKYTQLCQYLNTLTLAVP YNMRVI HFGAG S DKGVAPGTAVLR 182 

L NYG+ +P G MMNVAKYTQLCQYLNT TLAVP NMRV+H GAGS+KGVAPG+AVLR 
Sbjct: 6822 LWNYGKPVTLPTGCMMNVAKYTQLCQYLN^ 6881 

Query: 183 QWLPTGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPRTKHVTKENDSK 362 

QWLP GT+LVD+DL FVSD+ +T GDC T+ +WDLI ISDMYDP TK++ + N SK 
Sbjct: 6882 QWLPAGTILVDNDLYPFVSDSVATYFGDCITLPFDCQWDLIISDMYDPITKNIGEYNVSK 6941 

Query: 363 EGFFTYLCGFIKQKIiALGGSIAVKITEHS^ADLYKLMGHFSWPilTAFVTNVNASSSEAFL 542 

+GFFTY+C 1+ KLALGGS+A+KITE SWNA+LYKLMG+F++WT F TN NASSSE FL 
Sbjct: 6942 DGFFTYICHMIRDKLALGGSVAIKITEFSWNAELYKLMGYFAFWTVFCTNANASSSEGFL 7001 

Query: 543 IGANYI^KPKEQIDGYTMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQ 722 

IG NYLGKPK +IDG MHANY+FWRN+ +YSLFDM+KFPLKL GTAV++L+ +Q 

Sbjct: 7002 IGINYLGKPKVEIDGNVMHANYLFWRNSTVWNGGAYSLFDMAKFPLKLAGTAVINLRADQ 7061 

Query: 723 IN DMI YSLLEKGRLI I RENNRVWSSDILVN 815 

INDM+YSLLEKG+L++R+ N+ V D LVN 
Sbjct: 7062 INDMVYSLLEKGKLLVRDTNKEVFVGDSLVN 7092 
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Alianment 2 (Spike protein of coronavirus) 
HCV OC43 MFLI LLI S LPTAFAVI GDL-KCTTVS I NDI D 

MHV A59 MLFVFILFLPSCLGYIGDF-RCIQLVNSNGA 
™" MFLILLISLPMAFAVIGDL-KCTTVSINDVD 

MF-I FLLFlr- TLTSG-SDIiDRCT3?FDDVQAP 

MGCCAAAACAGCGCCGAC^ 



GCAAGGAGGAACUUAGAOUCCCOCGA^ 

vccc 

ougc^cdg^gggagccougaaoacaSco^ 



Translation 

PEASLPYGANKEGIVWVATEGALNTPKDHIGTRNPNHNXA 

nuclfocapsid protein - bovine coronavirus (strain Mebus) 
Identities = 55/129 (42%), 

Query: 1 ^^PQ^A"---^^^ 162 
Sbjct: 44 QPKOTATSQLPSGGNVVPYYSWFSGITQFQKGKEFE^GQGVPIAPGVPATEAKGTOYR 103 
Ouerv 163 RTP^-VRGGDGKMKELSPRMYFYYI^TGPEASLPYGANKEGIVWVATEGA-LOTPKDHIG 336 

Query. 163 rtkk v ^ ,3 ^' us ' ++l PRWYFY yi.gtgp a yg + +g+ wva+ a +ntp d i 

Sbjct: 104 HI^reTAlXWQRQia.PRWYFrVI^GPHAKDQYGTDIDGVF^HQADVNTPAD-IL 162 

Query: 337 TRNPNNNXA 363 

R+P+++ A 
Sbjct: 163 DBDPSSDEA 171 




Translation of putative open reading frames 
>>~onT- ♦ 9 to 178: Frame 2 59 aa 

DTOY^PDDF^IAWNTRNIDATSTGNYNYKYRYl^GKIJlPFERDISNVPFSPDGK 

s^ nn t. 3 to 17 6: Frame 3 58 aa 

1 1 IINCQMISWWSLLGIUST^ 
>~out: 87 to 13: Frame -2 25 aa 

LPVEVASMFLVFQARTQPMKSSGNL 
>-out: 178 to 80: Frame -1 33 aa 

FAIRGBRHIRYVSLKGPKLAMSKIPIFIIIITS 
>~out: 177 to 97: Frame -2 27 aa 

LPSGEKGTLDMSLSKGLSLPCLRYLYL 

EMC- 10: unknown sequence (minimal homology to NADH 

AUGGGATGGG 

Translation of putative open reading frames 
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v~-m<- . i f- n 519: Frame 1 173 aa 

^FnnvOAPNYTOHTSSMRGVYYPDEI FRSDTLYLTQDLFLPFYSNVTGFHTINHT FGN PVI PFKDGI YFAATEKSNWRGW 
REFVFKNKDGM 

>~out: 104 to 259: Frame 2 52 aa 

LRIYFFHFILMI^FILLIIRIATLSYLI^MVFIIO.PQRNQMLSVVGFLVLP . 

>-out: 222 to 115: Frame -3 36 aa 

FLCGSKINTILKRYDRVAKRMINSMKPCNIRIKWKK 

>~out: 192 to 290: Frame 3 33 aa 

GWYLFCCHREIKCCPWLGFWFYHEQQVTVGDYY 

. t-r» 204: Frame —1 106 aa 

PIPSLkNTOSWCCI^^ 

DCDLLSWVEPKTQPRTTFDFSVAAK 

>~out* 456 to 274: Frame -3 61 aa 

NIKRKGIRYVIiESAIKCI IEYHSMCLCTHGFRNSKERVVTQFKVTCSYNN I SRIVNNNHRL 
>-out: 284 to 391: Frame 2 36 aa 

LLLTILLMLLYEHVTLNCVTTLSLLFLNPWVHRHIL 

>~out: 395 to 520: Frame 2 42 aa 

YSIMHLIALSSTYLMPFRLMFQKSQVILNTYESLCUCIKMGW 

EMC- 11a* unknown secruence 

UUGCCDaGGOOCACUGAGCGCUCOGAUAAGAGCUACGAGCACCAGACACCCOOCGaAAUUAAGAG 
AAaSuGScACOUOCAAAAGGGGAAUGCCCCAAAGCOOG^ 

CCACGUGUUGAAAAGAAAAAGACUGAGGGDUUCAUGGGGCGOAUACGCUCOGUGUACCCUGOO 
AGGAGUGUAACAAUAUGCACUUGUCUACCUUGA 

?gaHuuuctoaaagccacougugaa(^dogoggcacdgaaaauoda 

gSguugcagaouaucacaaccacdcaaaca^^^ 
ggcucaggccaoacuggcauoacoggogacaauguggaga ! 

Translation of putative open reading frames 
>~out: 78 to 1: Frame -3 26 aa 

dfrschsydhrlpyyrsyeeehcgmq 

lwhdmcsydi^deSdpS^ 
eqld y ie skrgvyccrdhehe i awvh 

>~out* 283 to 89: Frame -2 65 aa 

LARACEKIFDAIKRVPIWATEIWDIASDCTSIELTSEFTECTTAMLSVPVLFIIFNGISAKLVT 

^~rmi- ■ 90 to 614: Frame 3 175 aa _ 

VTSLALIPLKIMNKTGTLSMANTVHSVNSLV^ 

HLHRSVTICTCLP 

>~out: 204 to 124: Frame -3 27 aa 

RVTAPPLSSRVSSRSAPLPCLVFQFCS 

>~out: 312 to 208: Frame -3 35 aa 

SSCSERVHIDLPARARKSLMQSRGYPSGPQKLLST 

>~out* 485 to 258: Frame —1 76 aa 

EETQALGHSPFESVNFLGTLNFEGCLVLVALIRALSEPRQ 

>-out: 397 to 287: Frame -2 37 aa 

LLSERSVNPGNFMLMVTAAVDTSLRLDVIKLFGKSAH 

>~out: 364 to 486: Frame 1 41 aa 

NCLGSLSALIRATSTRHPSKLRVPRKLTLSKGECPKACVSS 
>-out: 4 90 to 4 01: Frame -2 30 aa 

VKRKHKLWGI PLLKVS I FLALLI SKGVWCS 
>-out: 446 to 988: Frame 2 181 aa 




RASADIGSGHTGITGDNVE 
>~out: 64 3 to 4 94: Frame -2 50 aa 

SFIAMITFHQGRQVHIVTLLWRCNRVHRAYTPHETLSLFLFNTWLNDFDF 

>~out: 627 to 511: Frame -3 39 aa 

LH FI KVDKCI LLHSCGDATGYTERIRPMKPS VFFFSTRG 



>~out: 704 to 612: Frame -1 31 aa 

LNFQCHNVHKWLSESRTSAMKLHRNDYISSR 

>~ouf 774 to 631: Frame -3 48 % aa- 
QAGHGIFTTALVGRYPHVIiGPSITKFSVPQCSQVAFRKS&VCHEASSQ 

>~out: 826 to 737: Frame -2 30 aa 

WV 1 1 CNTMirRSN LWVLT GRTWH FH Y S I S R 

>~out: 863 to 744: Frame -1 40 aa 

SYLPCGVEFQCLSGCDNliQHYAQVQSLGLDRQDMAFSLQH 

. TRfi *-r% 989: Frame 3 78 aa 

KCWLPVCTQRLDLS^ 

>~out: 952 to 830: Frame -2 41 aa 

ANISTSTRNPVGTLIIAANIGKHTASKTSSPTSLAESSFNV 

TTMP-iTb- unknowri secruence 

TCUO^aGAGGAaCUCCUOGAGAUACDGAGUCGaGAACGOGDOAACAUUAACAOUGUDGGC^UDUO^DOO 
r^U^GAGGUUGCCAaCAYGUUGGCAUCYUUCDCOGCDUCaACAAGDGCCOUaAUDGACACDAUAAA 

r^GA^ACAAGU^ 

AAGGOGCODGGAACAUUGGACAACAGAGAUCAGObOUAACACCACOGUGU 

^oSauSSadc^duougcgcgcacacodc^dgcagcaaaccacu 

goSScaSaSogIogguaoudcdgaacagocauuacgucuug^ 

ucaccaacagdgocauuauoauggcaoaogoaacugguggocuugoacaacagacu 



QT 

>~out: 15r to 71: Frame -2 27 aa 

SRLFIVS IKALVEAEXDAKXMATS SFK 

>~out: 142 to 216: Frame 1 25 aa 

rvlitslskpllspavtikiipresp 
^s^o^ps^^ 

>~out: 197 to 322: Frame 2 42 aa 

SYQGKARKRCLEHWTTEISFNTTVWFSLTGCWCYQINFCAHT 

>-out: 294 to 202: Frame -3 31 aa 

HQQPVRENHT WLKLI S WQCSKHLLRAFPW 

•>~rmt- • 259 to 4 68: Frame 1 70 aa 

^CV\^PHRLIiVLSDQFLRAHLMQQTTO 

>-out: 392 to 4 93: Frame 2 34 aa 

TVITSCRRHGLYFRPAHQQCHYYGICNWWSCTTD . 



UGCUUGCUCAUGCUGAAGAGACAAG 
AAUGGCAACCAUCCAACGUAAGUAUAAAGGAAU^^ 

GGUGUCCGAUUCUUCUUUUAUACUAGUAAAGAGCCUGUAGCUUCUAUUAUUACG 

ACUCUCUAAAUC^GCCGCUUGUC^C^U^^ 
UGAAGAGGCUGCGCGCUGUAUGCGUUCU^ 

CCAGAUGCUGUUACUAC^UAUAAUGGAUAGCUCACUUCGUC^^ 

ACUUUGUAGAAACAGUUUCIJUUGGCUG 

UACAGAGUUAGGUGUUGAA 

Translation of putative open reading frames>~out: 3 to 446: Frame 3 

LAHAEETRKLMPICMDVRAIMATIQ 

EEAARCMRSLKAPAWSVSSPDAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSGQRTELGVE 

>~out: 100 to 11: Frame -2 30 aa 

ILIPLYLRWMVAIMALTSMHIGINFLVSSA 

>~out: 188 to 33: Frame -1 52 aa 

RVQLRNNRSYRLFTSIKEESDTIVNDALLNFNSFILTLDGCHYGSNIHAYRH 

>~out: 64 to 159: Frame 1 32 aa 

WQPSNVSIKELKFKRASLTMVSDSSFILVKSL 
>~out: 220 to 143: Frame -2 26 aa 

P IGI VT SGS FREFS FVI I EATG S LLV 
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>-out: 293 to 192: Frame -1 34 aa 

HYGRSFKRTHTARSLFKIKTMCHITNWHCDKRLI 
>-out: 397 to 224: Frame -2 58 aa 

EPAKETVSTKCSSDVFDDEVRYPLYVVTASGDDTDTTAGAI^RIQRAASSIU^CVT 
>-out: 229 to 288: Frame 1 20 aa 

HMVLI LKRLRAVCVLLKLLP 

>-out: 292 to 372: Frame 1 27 aa 

CQYHHQMLLLHIMDTSLRHQRHLRSTL 

>~out: 444 to 340: Frame -3 35 aa 

QHLTLYAVLNRTNLCKSQPKKLFI^SAPQMSLMTK 

>-out: 416 to 351: Frame -1 22 aa 

IGPISVRASQRNCFYKVLLRCL 

>-out: 365 to 445: Frame 2 27 aa 

GALCRNSFFGWLLQRLVLFRTAYRVRC 

>~out: 376 to 435: Frame 1 20 aa 

KQFLWLALTEIGPIQDSVQS 



EMC13: 

CUGAAGAAGUAGUGGaA7UVUCCDACCAOACAGAAGGAAGUCAOAGAGUGUGACGUGAAAACUACCGAAGDUGU 
AGGCAAUGUCA0AC0UAAACCAUCAGACGAAGGOGUUAAAGUAACACAAGAGUUAGGUCAUGAGGAUCUUADG 
GCU GCU U AUGUGGAAAACACAAGCAU UACCAUUAAGAAACCUAAUGAGCUOUCACU AGCC UOAGGUU UAAAAA 
CAADUGCCACUCAUGGOAOUGCUGCAAOUAAOAGUGOOCCUUGGAGUAAAAUUUUGGCUDAaGOCAAACCAUU 
CUUAGGACAAGCAGCAAOUACAACAUCAAAUUGCGCOAAGAGAOUAGCACAACGUGUGUOUAACAAUUAUAUG 
CCUOAUGUGOUUACAUOAUUGUOCCAAUUGUGUACDDUUACUAAAAGUACCAAUUCUAGAAUOAGAGCaUCAC 
UACCUACAACUAUUGCDAAAAAUAGUGUUAAGAGUGOOGCUAAAUUAUGOUUGGAUGCCGGCAUUAAUUAUGU 
GAAGUCACCCAAAUUUOCUAAAOUGUUCACAAUCGCOAUGUGGCUAUUGOUGDUAAGUAUDUGCUUAGGUUCU 
CUAAUCUG9GU7UVCUGCUGCUUOUGGUGUACUCUUAUCUAAUOUUGGUGCDCCUUCUUAUUGUAAUGGCGUUA 
GAGAAUUGUADCUOAAOUCGUCDAACGOUACDACUAUGGAUUUCUGUGAAGGUOCUUUUCCUUGCAGCAUUUG 
UUUAAGUGGAOUAGACaCCCUUGADUCOUAOCCAGCUCODGAAACCAUUCAGGUGACGAUaUCAUCGUACAAG 
CUAGACUUGACAAOUOOAGGUCUGGCCGCUG 



Translation 

>~out: 3 to 833: Frame 3 277 aa 

EEWENPTIQKEVIECDVKTTEWGNVILKFSDEGVKVTQELGHED^ 

INSVPWSKIIAYVKPFLGQAAITTSNCAKRIiAQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRASLM 

LDAGINYVKSPKFSKLFTIAMWLLLLSICLGSLICVTAAFGVLLSNFGAPSYCNGVEIELYLNSSNVTTMDFCEGSFPCSIC 
LSGLDSLDSYPALETIQVTISSYKLDLTILGLAA 



Alignment 

bovine coronavirus RNA-dependent RNA Polymerase 
Identities = 50/269 (18%), 

Query: 57 KTTEWGNVILKPSDEGVKVTQELGHEDLMAAYVENTSITIKKPNELSLALGLKTIATH- 233 

K +V +VI+ +K + L D+ ++ ++ N+LS+A+ + TI 

Sbjct: 204 6 KPFKVEDSVIVNDDTSEIKYVKSLSIVDVYDmLTGCRYVVRTANDLSMAVNVPTIRKFI 2105 

Query: 234 — GIAAIN SVPW SKI -LAYVKPFLGQAAI TT SNCAKRLAQRVFN — NYMPYVFTLLF 389 

G+ + S+P + L +KP N K + ++ N++ +*F LLF 

Sbjct: 2106 KFGMTLV-S I PIDLLNLREIKPVF NWKAVRNKI SACFNFIKWLFVLLFGWI 2156 

Query: 390 QLCT FTKSTN SRIRASLPTT I AKNS VKS VAKLCL DAG INYVKS PKFSKIiFT I AMW 554 

+T S++ L KN+ + + G + + +W 
Sbjct: 2157 KI S ADNKVI YTTE VAS KLTCKLVALAFKNAFLT FKWS WARGACI I AT IFLLW 2209 

Query: 555 XXXXXXXXXXXXXVTAAFGVLLSNFGAPSYCNGVRELYLNSSNVTTM 695 

G L P++ + + ++ ++ T+ 

Sbjct: 2210 FN FI Y AN V I FS D F YLPKIG FL PTFVGKIAQWIKSTFSLVTICDIiYSIQDVGFKN 2263 

Query: 696 DFCEGS FPCS ICLSGLDS LDSY PALET IQ 782 

+C GS C CL+G D LD+Y A++ +Q 
Sbjct: 2264 QYCNGSI ACQFCL AGFDM LDNYK AID VVQ 2292 
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Figure 3. 
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Figure 4. 



Comparison of N-termini of the S proteins of the group 2 coronaviruses 

HCV OC43 MFLILLISLPTAFAVIGDL-KCTTVSINDID 

MHV A59 MLFVFILFLPSCLGYIGDF-RCIQLVNSNGA 

BCV MFL I LL I S L PMAF AV I GDL-KCTT VS IN DVD 

SARS MF-IFLLFL-TLTSG-SDLDRCTTFDDVQAP 
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